Nonlinear filtering for speaker tracking in noisy and reverberant environments

نویسندگان

  • Jaco Vermaak
  • Andrew Blake
چکیده

This paper addresses the problem of speaker tracking in a noisy and reverberant environment using time delay of arrival (TDOA) measurements at spatially distributed microphone pairs. The tracking problem is posed within a state-space estimation framework, and models are developed for the speaker motion and the likelihood of the speaker location in the light of the TDOA measurements. The resulting state-space model is non-linear and nonGaussian, and consequently no closed-form solutions exist for the filtering distributions required to perform tracking. Here Sequential Monte Carlo (SMC) methods are applied to approximate the true filtering distribution with a set of samples. The resulting tracking algorithm requires no triangulation, is computationally efficient, and can straightforwardly be extended to track multiple speakers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive beamforming combined with particle filtering for acoustic source localization

While the main objective of adaptive Filter-and-Sum beamforming is to obtain an enhanced speech signal for subsequent processing like speech recognition, we show how speaker localization information can be derived from the filter coefficients. To increase localization accuracy, speaker tracking is performed by non-linear Bayesian state estimation, which is realized by sequential Monte Carlo met...

متن کامل

Verified speaker localization utilizing voicing level in split-bands

This paper proposes a joint verification-localization structure based on split-band analysis of speech signal and the mixed voicing level. To address the problems in reverberant acoustic environments, a new fundamental frequency estimation algorithm is proposed based on high resolution spectral estimation. In the reconstruction of the distorted speech this information is utilized to reduce the ...

متن کامل

Advances in Radio Science Speaker tracking with a microphone array using Kalman filtering

In this publication a method for tracking a speaker with acoustical information by means of a microphone array is presented. A sound source localization algorithm based on the time delays of arrival of sound waves in microphone pairs provides initial position estimates. These significantly varying estimates are spatially filtered by an adaptive Kalman filter to obtain a smoothed trajectory of t...

متن کامل

Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays

In this paper we describe the additive robustness obtained through the combined use of a first acoustic processing step based on a low complexity microphone array, followed by a spectral normalization step. Microphone arrays have shown to provide good results in reducing different sources of acoustic degradation. However, microphone arrays produce linear filtering effects that need to be compen...

متن کامل

Concurrent speaker localization using multi-band position-pitch (m-popi) algorithm with spectro-temporal pre-processing

Accurate, microphone-based speaker localization in real-world environments, like office spaces or meeting rooms, must be able to track a single speaker and multiple concurrent speakers in the presence of reverberations and background noise. Our Multiband Joint Position-Pitch (M-PoPi) algorithm for circular microphone arrays already shows a frame-wise localization estimation score of about 95% f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001